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Abstract —The possibility of latency arbitrage in financial 
markets has led to the deployment of high-speed communication 
links between distant financial centers. These links are noisy and 
so there is a need for coding. In this paper, we develop a game- 
theoretic model of trading behavior where two traders compete to 
capture latency arbitrage opportunities using binary signalling. 
Different coding schemes are strategies that trade off between 
reliability and latency. When one trader has a better channel, 
the second trader should not compete. With statistically identical 
channels, we find there are two different regimes of channel noise 
for which: there is a unique Nash equilibrium yielding ties; and 
there are two Nash equilibria with different winners. 

“One guy says to me, ‘It doesn’t matter if I’m one second slower 
or one microsecond; either way I come in second place.”’ 

— Flash Boys: A Wall Street Revolt [T] p. 63] 

I. Introduction 

The traditional view in information theory is that commu¬ 
nication reliability is to be obtained by paying a penalty in 
latency. Though there is growing interest in finite blocklength 
information theory 0 - 13 , motivated by general notions that 
latency is problematic, few works have put forth explicit 
reasons for needing to meet latency constraints, cf. Q. 

When there are latency arbitrage opportunities in financial 
markets, however, latency is a key performance metric 
One must be faster than one’s competitors. Indeed, the speed 
of light is a key constraint and physical distance a key consid¬ 
eration when building communication channels, whether fiber 
optic or microwave m, 0. 

When prices of the same financial instrument in different 
financial markets (say Chicago and New York) diverge for 
a short period of time, this is called a latency arbitrage 
opportunity. Low-latency traders jump in and trade away 
the price discrepancies. As an example, a low-latency trader 
would sell pork bellies in the market where the commodity 
is temporarily overpriced, while simultaneously buying it in 
the distant market where the commodity trades too cheaply. 
In the process, the demand and supply produced by the low- 
latency trader equilibrates market prices in markets that were 
previously divergent. Then, the low-latency trader quickly 
reverses his position to capture the gain, and investors of 
all frequencies can be assured that prices on traded financial 
instruments are consistent across the globe, upholding what is 
called the law of one price 0. 

What are the communication requirements for a latency 
arbitrageur? Some unreliability is allowable since there is not 
a need for perfect information to make riskless profits; there 


is only need to skew the odds systematically in one’s favor HI 
p. 74]. A recent article noted ill: 

that high-frequency traders would much rather have 
access to a communications channel that’s faster 
than every one else’s, even if it gets flaky every now 
and then. A link that’s second or third fastest isn’t 
of much use to them, even if it’s always available. 
That’s a very different calculus than the one most 
engineers use-hut it’s clearly the one you want to 
follow if you’re trying to get ahead of the pack. 
Designing for such novel requirements suggests new problems 
at the intersection of communication theory and game theory, 
and provides a concrete reason to step back from infinity. Some 
previous work on low-latency communication did not consider 
the competitive nature of the communication ||9l- 

One should be careful not to confuse low-latency trading 
with “high-frequency trading.” Low-latency refers to the abil¬ 
ity to quickly route and execute orders irrespective of their 
position-holding time, whereas high-frequency refers to the 
fast turnover of capital that may require low-latency execution 
capability 0. Although academic study of low-latency trading 
has been limited, Moallemi and Saglam put forth three main 
reasons for wanting low latency in trading HOl: 

1) Staleness: A trader with significant latency will make 
trading decisions based on information that is stale. 

2) Relative Latency: A trader will want to act to exploit 
a discrepancy before a price correction takes place, i.e. 
before competitors are able to act. 

3) Ordering: Modern markets treat orders differentially 
based on time of arrival. 

They focus on a model that captures staleness, but do not 
consider relative latency. The goal herein is to look at relative 
latency through a mathematical model of repetitive binary 
signalling that is essentially a decoding game, where decoding 
time is the strategy. 

Our main results show that when one trader has a better 
channel, the second trader should not compete. With sta¬ 
tistically identical channels, we find there are two different 
regimes of channel noise for which: there is a unique Nash 
equilibrium yielding ties; and there are two Nash equilibria 
with different winners, respectively. A question commonly 
asked by market participants and regulators alike is how 
much speed is enough? Clearly, the race for speed will end 
when there is equilibrium: when an additional dollar spent on 
technology no longer generates extra return 0. Our results 
also take steps to answering this question. 


II. Problem Formulation 

We put forth a stylized model of low-latency trading. 

A. Arbitrage Payoffs 

Let us assume there are two hrms A and B who want to 
exploit latency arbitrage between markets I and II. Both firms 
have agents in both markets. There is a single commodity 
under consideration and the values of trades are hxed. Hence 
there are just two types of messages: “Buy” and “Seir’Q The 
message is transmitted by the agent of each firm in market I. 
The agents in market II try to recover their message correctly 
with high probability and as fast as possible. Payoffs are 
determined by the order in which the hrms act. So we describe 
payoffs accordingly. 

• First Firm: The hrst hrm to act in market II based on its 
estimate of the message has the following payoffs: 

- If the transmitted message is “Buy,” and the agent 
in market II correctly recovers it, the hrm earns Vi 
dollars. If the agent makes a mistake and assumes 
the transmitted message was a “Sell,” the hrm loses 
Vi dollars (i.e., the payoff is —Vi dollars). 

- If the transmitted message is “Sell,” and the agent 
in market II correctly recovers it, the hrm earns V 2 
dollars. If the agent makes a mistake and assumes 
the transmitted message was a “Buy,” the hrm loses 
V 2 dollars (i.e., the payoff is —V 2 dollars). 

• Second Firm: Since we assume the market has equili¬ 
brated after the hrst transaction, the second hrm decoding 
the message will have zero payoff whether or not it 
recovers the message correctly. 

• Simultaneous Decoding: If both hrms perform decoding 
and act upon it at the same time, each of them receives 
half of the payoff that the hrst hrm would receive. 

• Efficient Market: We assume that after time Tg, the 
market becomes efficient through the action of high- 
latency traders and the arbitrage opportunity disappears. 

We assume the prior probabilities Pi and P 2 for the messages 
“Buy” and “Sell,” where Pi + P 2 = 1- Furthermore, each 
transaction has a cost for any hrm since there are real costs 
associated with accessing an exchange, ranging from payment 
for direct market feeds to managing routing logic: we assume a 
cost of c dollars per transaction. Further, we assume that each 
hrm pays dS dollars to use a channel with signal-to-noise 
ratio S'; this is a linear function of signal power as in most 
physical communication transmitters. The maximum signal- 
to-noise ratio of the channel is Sg. 

Clearly, this is a hrst step towards modeling the payoffs 
in high-speed trading in hnancial markets. There are other 
possible ways to model payoffs depending on the market under 
consideration. 

*In the real system, messages are transmitted via a Financial Information 
exchange (FIX) protocol. A typical FIX message is composed of a header, a 
body containing order execution directives, and a checksum m 


B. Strategy Space 

The strategy space of each hrm is a nonnegative integer, 
denoted by Ta and Tb, respectively which is the delay in 
before decoding the message. Specihcally, the decision time 
of hrm A, Ta, and the decision time of hrm B, Tb are the 
times when the agents in market II of each hrm decide to 
decode the message and act based on their estimates of the 
transmitted message. The fundamental speed-accuracy tradeoff 
can be parameterized by T: the larger the decoding delay for 
a hrm, the smaller probability of error it has on average. Thus, 
the expected payoff would be higher, if it is the first firm to 
decode and act. But as the delay increases, it gets less likely 
for a hrm to be the hrst actor. 


C. Communication Scheme 

For concreteness, let us suppose that we are communicating 
over a discrete-time AWGN channel with power constraint P. 
The noise power is assumed to be Nq, so that the signal-to- 
noise ratio of the channel is S' = P/Nq. 

We assume that each agent in market I uses binary phase- 
shift keying (BPSK) repetition coding. Admittedly, this is a 
very simple model of the communication channel and coding 
scheme. But we believe that studying this model captures the 
essence of the latency-accuracy tradeoff that is of interest to us. 
If the message is “Buy,” the agent sends -fa into the AWGN 
channel; the received symbol is a normally-distributed random 
variable Af{+a, Nq). Similarly, if the message is “Sell,” the 
agent sends —a into the AWGN channel; the received symbol 
is a normally-distributed random variable A/^(—a, Ag). The 
power constraint implies the optimal choice a = \fP. 

The agent at market II estimates the transmitted message 
after receiving T symbols, yi,..., yx. The sufficient statistic 
of the received symbols is ^ Vi- decoder compares 
this sufficient statistic with a chosen threshold, h. The optimal 
method to choose the threshold will be discussed later. Thus, 
optimal decoding is: 

-1 ^ Buy 

y I] y* I 

^ i=l Sell 

There are two types of error events: 

• If the transmitted message was “Buy” and the decoder 
decided “Sell,” we have the error event of type 1. The 
probability of this error event is: 


^^i = P(error|Buy) 
1 T 
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.-hVP, 
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where Q{-) is the cumulative distribution function of the 
standard normal distribution, and 


7i = (l-h)V2. (I) 






• If the transmitted message was “Sell” and the decoder 
decided “Buy,” we have the error event of type 2 . The 
probability of this error event is: 

Pf^2 = IP(error|Sell) 

1 ^ 

= > /i/P|Sell) 

= = Q( /t—( l + /r)2) 

^ VTiVo 

= Q{y/ 2 TS^ 2 ) ~ exp(-rs'72) 

where 

72 = (1 + hf/2. (2) 

D. Expected Payojf 

• Expected payojf of the first firm: As mentioned, with 
probability Pi the transmitted message is “Buy” and with 
probability P 2 , it is “Sell”. The first firm decodes the 
transmitted message based on its own estimation of the 
message and makes a transaction (and therefore pays the 
transaction fee of c dollars). Thus, the expected payoff 
of the first firm is as following: 

E [payoff of first firm] 

= E [pay off I being first] — c — dS 
= PiE[payoff|message is Buy and first] 

+ P2E[payoff|message is Sell and first] — c — dS 

= Pllil - Pe,l)Vl + Pe,l{-Vl)] 

+ P 2 [(1 - Pe,2)V2 + Pe.2(-1^2)] - C - dS 
= PiViil - 2Pe,l) + P 21 ^ 2(1 - 2Pe,2) -C-dS 
Ki PiVi[l - 2e-'^^^i] + P2V^2[1 - 2e-'^^^=] -c-dS 

• Expected payoff of the second firm: The expected payoff 
of the second firm is —dS regardless of its decoding 
probability of error, since the payoff of the transaction 
is zero. The second firm does not pay the transaction fee 
as it does not make any transaction. 

If the decision time of firms are T.4 = T and Tb, and 
also the signal-to-noise ratios are 2S'7i_^, 2 Sj 2 ,A, 2S'7i^b and 
2S'72 .b, then the expected payoff of firm A would be: 

t^a{T,Tb) 

= I{r < min{TB,ro}} [Pi 1^1 [1 - 

+ P2F2[1 - 2e-^'^-"^='-"] -c-dSA] 

+ I{r = min{TB,To}} [Piyi[l - 2 e -^^^^^-^]/2 
+ P2V2[1 - 2e-^^^T'^'^]/2 -c-dSA] 

+ I{T > min{TB, Pq}} [-dS'A] 

and symmetrically for the other firm. 

III. Optimizations 

Having defined the players, the strategies, and the payoffs 
in this standard form game, we can now consider some opti¬ 
mization problems, before turning to questions of equilibria. 


A. Optimal Decoding 

We study an optimal adaptive decoding strategy in which 
the threshold value and tradeoff point between two error types 
of the decoder depends on its chosen value of delay. 

Let the decoder choose its decoding threshold knowing the 
decision time: 

h\{TA,SA) = argmaxPiLi[l - 26 “^''^'"^^'''] 

h 

+ P2L2[1 - 26“^"'^'^^=''^] 
P 1 VI-P 2 V 2 

PiVi + (1 - 2TaSa)P2V2 

Note that this choice of threshold captures the tradeoff appear¬ 
ing due to different expected payoff of transmitted messages. 
The decoder can reduce the probability of error given the 
message was a “Buy” at the cost of a higher probability of 
error for the “Sell” messages. 

yl{T) = {l-h*{T,S*)f /2 
^*2{T) = {lph*{T,S*)f/2. 

We will also need to define the following functions for use in 
equilibrium analysis: 

Fa{Ta) 

= PiViil - 2 PeA,A) + ^2^2(1 - 2Pe.2,A) 

Ri PiLl[ 1 - -f P2L2[1 - 

Fb{Tb) 

= PiViil - 2Pe,l,B) + P2V2{1 - 2Pe,2.B) 

Ri PiLi[l - 2e"^®'^s7i*,B(rfl)] P2V2[1 - 2e“^'^'®'®^2*,BCrB)] 

B. Optimal Power Allocation 

The transmitter needs to determine the allocated signal- 
to-noise ratio ahead of time before the communication takes 
place, since this is typically not adaptable at short timescales. 
We further assume the signal-to-noise ratio is determined 
not knowing the actual decision time. Thus, the allocated 
signal-to-noise ratio is assumed to be optimal for a chosen 
decision time. We assume that the chosen decision time which 
determines the signal-to-noise ratio is the time market becomes 
efficient, Tq. 

S* = arg max PiLi[l - 2 e“^“^^i ]/2 
S<So 

+ P2V2{1 - 2e-^®T'=]/2 -c-dS 

where 

, ^ PiVi - P 2 V 2 

PiVi + (1 - 2ToS)P2V2 

and 7i and 72 are defined as in ([T]) and (|2|i. 








IV. Equilibria 


Now we consider Nash equilibria for the game. We observe 
that the expected payoff of firm A when it chooses the optimal 
decoder is the following: 

'^a{Ta,Tb) = 1{Ta < min{rB,To}} [Fa{Ta) -c- (ISa] 

+ 1{Ta = min{rs, To}} [Fa{Ta)/ 2 - c - 
+ I{Ta > min{rB,To}} 

Since Fa{Ta) is a strictly increasing function, given Tb 
and To, the optimal strategy for firm A is as follows: 

TTb + 1, c > max{F a (Tb - 1), Fa(Tb)/2} 
TX{Tb) = <^ Tb - 1, Fa{Tb - 1) > max{c,F a(Tb)/2} 
[tb, Fa(Tb)/ 2 > max{FA(TB - 1 ),c} 

(4) 

Similarly for firm B. Now we try to find the equilibrium of 
this game in the symmetric case. In this scenario, the functions 
Fa(-) and Fb(-) behave similarly. 

We are interested in identifying the long-term behavior of 
the system when this game is repeated many times. Thus, we 
assume the agents follow the best response dynamics defined 
as follows: the game is repeated many times and each firm at 
each repetition of the game chooses the best strategy based 
on the strategy of the competitor in the previous repetition. 
Conditioned on the monotonicity of the payoff function, when 
there exists a Nash equilibrium in the game, the best response 
dynamics converges ifT^ . 

We observe we can expect two types of behavior from 
this game depending on the regime in which parameters are 
defined: 

TIE Eor some T*, there is Nash equilibrium (Ta,Tb) = 
(T*, T*) and (Ta, Tb) = {T*, T*). This is a valid Nash 
equilibrium of the game if the following properties hold: 

F(T*)/2 > max{F(T* - l),c} (5) 

WIN Eor some T*, we have two Nash equilibria (Ta,Tb) = 
(T* + 1,T*) and (Ta,Tb) = (T*,T* + 1). This is 
a valid Nash equilibrium of the game if the following 
properties hold: 

c < F(T*) < 2c 

F{T* + 1) < 2F(T*) (6) 

F(T* - 1) < c 

We first show that the TIE conditions and the WIN conditions 
partition the possibilities. Note that the monotonicity condition 
of the lemma is clearly true under optimal decoding: more ob¬ 
servations lead to lower eiTor probabilities and higher expected 
payoffs. 

Lemma 1. Let us assume function F(-) is strictly increasing 
and F(0) < c < F(To). Then there exists a 0 < T* < Tq 
that satisfies either the condition set given in equation © or 
the condition set given in equation ®. 


Proof: Let us look at T* such that F(T* — 1) < c < 
F{T*). We can observe simply that if F(T*) > 2c, then 
F(T*)/2 > max{F(T* - l),c}. Similarly if F{T* + 1) > 
2F(T*), then F{T* + l)/2 > max{F(T*),c} 

■ 

Now we prove the equilibrium properties under the TIE and 
(ISa] conditions, respectively. 

Theorem 1. Suppose that both agents start the game for some 
(Ta, Tb), the repeated game under the best response dynamics 
converges to one of the possible Nash equilibria. If the set of 
conditions given in equation © is satisfied for some T*, the 
agents converge to simultaneous decoding at time T*. If the 
set of conditions given in equation © is satisfied for some T*, 
depending on the starting point, they converge to {Ta,Tb) = 
{Tf T* + 1) or {Ta, Tb) = {T* + 1, T*). 

Proof: We can construct the best response state graph of 
the game. Each state in this graph corresponds to one possible 
strategy set. The transitions correspond to better/best responses 
of any one of the players to the strategy given in the previous 
state (in each transition, we assume that only one player can 
change its strategy). The transitions could be determined from 
the equations given in @. The pure state Nash equilibria of 
a game is the sink states of this state graph as no player can 
unilaterally improve his payoff in these states, the sink states 
of this state graph. We could have two possible types of state 
graphs depending which of these equations © and are 
satisfied. Due to the monotonicity of the function F(-), in any 
case this state graph is acyclic and starting from any state, it 
converges to one of the sink states. ■ 

Notice that the TIE conditions specify that the payoff in¬ 
creases faster than exponentially as a function of time, whereas 
the WIN conditions specify that the payoff increases slower 
than exponentially as a function of time. Since the quality of 
the channel determines the rate at which the error probability 
decays/ payoff increases, it seems that the stronger channels 
would impose the TIE result between the agents, whereas the 
weaker channels would impose inequality in the results of the 
game.is possible to win a trading race (in equilibrium) when 
the channel is not so noisy. 

V. Non-identical Channels 

Thus far we have been discussing games where the noise 
powers for the two players are identically Nq. Eor complete¬ 
ness, let us state what happens when the channels do not have 
identical statistics. If different firms have channels of different 
quality, the signal-to-noise ratio and the optimal transmitted 
power would be different for each firm. The expected payoff of 
each firm still follows the equation ©. The Nash equilibrium 
of this game will be asymmetric in the following sense. Let 
us assume that Fg^{c) < Ff^{c). The equilibrium will be 
{Ta,Tb) = (L^a^(c)J + 1, [F^^(c)J). This equilibrium is 
intuitive as we expect the firm with the better (or cheaper) 
communication channel to be more powerful and exploit 
the arbitrage in the market. This is what is observed with 


(3) 


an incredibly expensive arms race to build better physical 
communication channels. 

VI. Conclusion 

We have formulated a stylized model of the communication 
race that forms the heart of low-latency trading to take advan¬ 
tage of latency arbitrage opportunities in financial markets. 
We found the existence of Nash equilibrium communication 
strategies: for one set of channel parameters, there is a unique 
equilibrium that corresponds to ties where the two firms 
share the arbitrage opportunity. For another set of channel 
parameters, there are two possible equilibrium when one or 
the other firm wins the opportunity. 

Our modeling approach cast time as discrete, but one 
might wonder what happens in continuous-time models of 
communication races. Rather than settling into Nash equilibria, 
best response dynamics may go into a limit cycle since it is 
possible to just act more quickly than one’s opponent; the 
discontinuity prevents equilibria for the real-valued strategy 
space. Even within a discrete-time model that comes from 
continuous time, the relationship between clocking speed and 
noise could make clocking a part of the strategy space. 

Herein we have used various simplifications that can be 
relaxed in future work. We can consider markets with more 
than two messages; advanced methods in coding theory would 
then become important rather than simply having BPSK repe¬ 
tition. Moreover, microwave links that are now deployed incur 
fading in addition to noise; the role of outage in trading could 
be intriguing. Considering the game with many competitive 
firms rather than just two is another possible extension to the 
basic framework. 

Finally, we have focused here on expected payoffs, as 
is typical in game theory, but in many financial settings, 
risk is also a strong consideration. By considering risk in 
addition to expected payoff as part of performance criteria, 
new optimizations and equilibria may arise. 
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